A Distributed-GPU Deep Reinforcement Learning System for Solving Large Graph Optimization Problems

نویسندگان

چکیده

Graph optimization problems (such as minimum vertex cover, maximum cut, traveling salesman problems) appear in many fields including social sciences, power systems, chemistry, and bioinformatics. Recently, deep reinforcement learning (DRL) has shown success automatically good heuristics to solve graph problems. However, the existing RL systems either do not support environments or multiple GPUs a distributed setting. This compromised ability of solving large-scale due lack parallelization high scalability. To address challenges scalability, we develop RL4GO , high-performance distributed-GPU DRL framework for focuses on class computationally demanding problems, where both environment policy model are highly computation intensive. Traditional often assume is low time complexity small. In this work, distribute graphs across use spatial parallelism data achieve scalable performance. We compare analyze performance show their differences. neural network (GNN) layers that take input samples partitioned GPUs, design parallel mathematical kernels perform operations 3D sparse dense tensors. handle costly environments, scale up all RL-environment-related operations. By combining GNN with environment, able training inference algorithms parallel. Furthermore, propose two techniques—replay buffer on-the-fly generation adaptive multiple-node selection—to minimize cost accelerate learning. work also conducts in-depth analyses efficiency memory shows designed numerous GPUs. Evaluations (1) can 192 (2) its be 18 times faster than state-of-the-art Gorila [ 34 ], (3) achieves 26 improvement over Gorila.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Distributed Reinforcement Learning Approach for Solving Optimization Problems

Combinatorial optimization is the seeking for one or more optimal solutions in a well defined discrete problem space. The optimization methods are of great importance in practice, particularly in the engineering design process, the scientific experiments and the business decision-making. We are investigating in this paper a distributed reinforcement learning based approach for solving combinato...

متن کامل

Extended Distributed Learning Automata: A New Method for Solving Stochastic Graph Optimization Problems

In this paper, a new structure of cooperative learning automata socalled extended learning automata (eDLA) is introduced. Based on the proposed structure, a new iterative randomized heuristic algorithm for finding optimal subgraph in a stochastic edge-weighted graph through sampling is proposed. It has been shown that the proposed algorithm based on new networked-structure can be to solve the o...

متن کامل

Distributed Reinforcement Learning for Multiple Objective Optimization Problems

This paper describes the application and performance evaluation of a new algorithm for multiple objective optimization problems (MOOP) based on reinforcement learning. The new algorithm, called MDQL, considers a family of agents for each objective function involved in a MOOP. Each agent proposes a solution for its corresponding objective function. Agents leave traces while they construct soluti...

متن کامل

Extended Distributed Learning Automata An Automata-based Framework for Solving Stochastic Graph Optimization Problems

In this paper, a new structure for cooperative learning automata called extended learning automata (eDLA) is introduced. Based on the new structure, an iterative randomized heuristic algorithm using sampling is proposed for finding an optimal subgraph in a stochastic edge-weighted graph. Stochastic graphs are graphs in which the weights of edges have an unknown probability distribution. The pro...

متن کامل

GD-GIBBS: a GPU-based sampling algorithm for solving distributed constraint optimization problems

Researchers have recently introduced a promising new class of Distributed Constraint Optimization Problem (DCOP) algorithms that is based on sampling. This paradigm is very amenable to parallelization since sampling algorithms require a lot of samples to ensure convergence, and the sampling process can be designed to be executed in parallel. This paper presents GPU-based D-Gibbs (GD-Gibbs), whi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Transactions on Parallel Computing

سال: 2023

ISSN: ['2329-4949', '2329-4957']

DOI: https://doi.org/10.1145/3589188